Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System ⋆

نویسندگان

  • Zhongpu Xia
  • Dongbin Zhao
  • Huajin Tang
چکیده

In this paper, a model-free and effective approach is proposed to solve infinite horizon optimal control problem for affine nonlinear systems based on adaptive dynamic programming technique. The developed approach, referred to as the actor-critic structure, employs two multilayer perceptron neural networks to approximate the state-action value function and the control policy, respectively. It uses data collected arbitrarily from any reasonable sampling distribution for policy iteration. In the policy evaluation phase, a novel objective function is defined for updating the critic network, and thus makes the critic network converge to the Bellman equation directly rather than iteratively. In the policy improvement phase, the action network is updated to minimize the outputs of the critic network. The two phases alternate until no more improvement of the control policy is observed, such that the optimal control policy is achieved. Two simulation examples are provided to show the effectiveness of the approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybrid model predictive control of a nonlinear three-tank system based on the proposed compact form of piecewise affine model

In this paper, a predictive control based on the proposed hybrid model is designed to control the fluid height in a three-tank system with nonlinear dynamics whose operating mode depends on the instantaneous amount of system states. The use of nonlinear hybrid model in predictive control leads to a problem of mixed integer nonlinear programming (MINLP) which is very complex and time consuming t...

متن کامل

On the Design of Nonlinear Discrete-Time Adaptive Controller for damaged Airplane

airplane in presence of asymmetric left-wing damaged. ‎Variations of the aerodynamic parameters, mass and ‎moments of inertia, and the center of gravity due to ‎damage are all considered in the nonlinear ‎mathematical modeling. The proposed discrete-time ‎nonlinear MRAC algorithm applies the recursive least ‎square (RLS) algorithm as a parameter estimator as ‎well as the error between the real ...

متن کامل

Adaptive Consensus Control for a Class of Non-affine MIMO Strict-Feedback Multi-Agent Systems with Time Delay

In this paper, the design of a distributed adaptive controller for a class of unknown non-affine MIMO strict-feedback multi agent systems with time delay has been performed under a directed graph. The controller design is based on dynamic surface control  method. In the design process, radial basis function neural networks (RBFNNs) were employed to approximate the unknown nonlinear functions. S...

متن کامل

Optimal discrete-time control of robot manipulators in repetitive tasks

Optimal discrete-time control of linear systems has been presented already. There are some difficulties to design an optimal discrete-time control of robot manipulator since the robot manipulator is highly nonlinear and uncertain. This paper presents a novel robust optimal discrete-time control of electrically driven robot manipulators for performing repetitive tasks. The robot performs repetit...

متن کامل

Iterative learning identification and control for dynamic systems described by NARMAX model

A new iterative learning controller is proposed for a general unknown discrete time-varying nonlinear non-affine system represented by NARMAX (Nonlinear Autoregressive Moving Average with eXogenous inputs) model. The proposed controller is composed of an iterative learning neural identifier and an iterative learning controller. Iterative learning control and iterative learning identification ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014